Skip to content

docs: record offload CLI + #307/#316 status in roadmap docs#319

Merged
xiaguan merged 1 commit into
mainfrom
docs/roadmap-drift-post-316
Jun 9, 2026
Merged

docs: record offload CLI + #307/#316 status in roadmap docs#319
xiaguan merged 1 commit into
mainfrom
docs/roadmap-drift-post-316

Conversation

@xiaguan

@xiaguan xiaguan commented Jun 9, 2026

Copy link
Copy Markdown
Collaborator

Roadmap-doc drift fixes for work that already landed (#307 batched greedy sampling, #316 in-process pegaflow KV offload). Doc-only.

  • offload integration doc — TL;DR/§0 "未接 server CLI" → wired (#316: --kv-offload / --kv-offload-host-gib / --no-prefix-cache, plain + LoRA); §3 table / §4 route corrected so Qwen3-4B reads as already shipped and Kimi-K2 as the next candidate (the route changed mid-flight but those rows still said "Kimi 首发").
  • execution.md — new "KV data plane (pegaflow)" entry under cross-model infra; #307/#316 added to the Qwen3-4B Done list.
  • qwen3 roadmap — capability table gains a batched-greedy row (#307) and an L2-offload row (#316); Now#2 marked phase-1-done; TL;DR notes what landed since the 2026-06-04 verification.
  • index.md — offload row CLI status.

🤖 Generated with Claude Code

Post-#316 drift fixes (re-landed after the openinfer rename):
- offload integration doc: "未接 server CLI" → wired (#316), and the §3/§4
  Kimi-first/Qwen-next labels corrected (Qwen3-4B actually shipped first).
- execution.md: KV data-plane entry under cross-model infra + #307/#316 in
  the Qwen3-4B Done list.
- qwen3 roadmap: batched-greedy (#307) and L2 offload (#316) rows + TL;DR note.
- index.md: offload row CLI status.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
@xiaguan xiaguan merged commit 8f72023 into main Jun 9, 2026
1 check passed
@xiaguan xiaguan deleted the docs/roadmap-drift-post-316 branch June 9, 2026 14:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant